Intelligent Process Supervision Using Renforcement Learning and Temporal Abstraction

نویسنده

  • Ernesto Martinez
چکیده

Supervisory control usually involves timely switching among different courses of action over multiple time scales. In this work, intelligent process supervision is addressed in the context of semi-Markov decision processes and reinforcement learning. Temporally extended actions that represent a way of behaving together with a termination condition are used to achieve a set of operational goals/sub-goals comprising a supervision task. The control strategy resorts to a hierarchy of macro-actions or options which are made up of closed-loop sequences of low-level, primitive actions. Supervisory control of a buffer tank is discussed as a representative example. Copyright © 2002 IFAC

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Apprentissage par renforcement pour les processus décisionnels de Markov partiellement observés Apprendre une extension sélective du passé

We present a new algorithm that extends the Reinforcement Learning framework to Partially Observed Markov Decision Processes (POMDP). The main idea of our method is to build a state extension, called exhaustive observable, which allow us to define a next processus that is Markovian. We bring the proof that solving this new process, to which classical RL methods can be applied, brings an optimal...

متن کامل

Hierarchical Functional Concepts for Knowledge Transfer among Reinforcement Learning Agents

This article introduces the notions of functional space and concept as a way of knowledge representation and abstraction for Reinforcement Learning agents. These definitions are used as a tool of knowledge transfer among agents. The agents are assumed to be heterogeneous; they have different state spaces but share a same dynamic, reward and action space. In other words, the agents are assumed t...

متن کامل

Towards Intelligent Execution Supervision for Flexible Assembly Systems

Research results concerning error detection and recovery in robotized assembly systems, key components of flexible manufacturing systems, are presented. The approach to the integration of services and the modelling of tasks, resources and enviroment is described. A planning strategy and domain knowledge for nominal plan execution and for error recovery is presented. A supervision architecture p...

متن کامل

Control of Multivariable Systems Based on Emotional Temporal Difference Learning Controller

One of the most important issues that we face in controlling delayed systems and non-minimum phase systems is to fulfill objective orientations simultaneously and in the best way possible. In this paper proposing a new method, an objective orientation is presented for controlling multi-objective systems. The principles of this method is based an emotional temporal difference learning, and has a...

متن کامل

Emotional Learning Based Intelligent Controller for MIMO Peripheral Milling Process

During the milling process, one of the most important factors in reducing tool life expectancy and quality of workpiece is the chattering phenomenon due to self-excitation. The milling process is considered as a MIMO strongly coupled nonlinear plant with time delay terms in cutting forces. We stabilize the plant using two independent Emotional Learning-based Intelligent Controller (ELIC) in par...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2002